55 research outputs found

    An Ensemble-based Approach to Click-Through Rate Prediction for Promoted Listings at Etsy

    Full text link
    Etsy is a global marketplace where people across the world connect to make, buy and sell unique goods. Sellers at Etsy can promote their product listings via advertising campaigns similar to traditional sponsored search ads. Click-Through Rate (CTR) prediction is an integral part of online search advertising systems where it is utilized as an input to auctions which determine the final ranking of promoted listings to a particular user for each query. In this paper, we provide a holistic view of Etsy's promoted listings' CTR prediction system and propose an ensemble learning approach which is based on historical or behavioral signals for older listings as well as content-based features for new listings. We obtain representations from texts and images by utilizing state-of-the-art deep learning techniques and employ multimodal learning to combine these different signals. We compare the system to non-trivial baselines on a large-scale real world dataset from Etsy, demonstrating the effectiveness of the model and strong correlations between offline experiments and online performance. The paper is also the first technical overview to this kind of product in e-commerce context

    Remote Work Optimization with Robust Multi-channel Graph Neural Networks

    Full text link
    The spread of COVID-19 leads to the global shutdown of many corporate offices, and encourages companies to open more opportunities that allow employees to work from a remote location. As the workplace type expands from onsite offices to remote areas, an emerging challenge for an online hiring marketplace is how these remote opportunities and user intentions to work remotely can be modeled and matched without prior information. Despite the unprecedented amount of remote jobs posted amid COVID-19, there is no existing approach that can be directly applied. Introducing a brand new workplace type naturally leads to the cold-start problem, which is particularly more common for less active job seekers. It is challenging, if not impossible, to onboard a new workplace type for any predictive model if existing information sources can provide little information related to a new category of jobs, including data from resumes and job descriptions. Hence, in this work, we aim to propose a principled approach that jointly models the remoteness of job seekers and job opportunities with limited information, which also suffices the needs of web-scale applications. Existing research on the emerging type of remote workplace mainly focuses on qualitative studies, and classic predictive modeling approaches are inapplicable considering the problem of cold-start and information scarcity. We precisely try to close this gap with a novel graph neural architecture. Extensive experiments on large-scale data from real-world applications have been conducted to validate the superiority of the proposed approach over competitive baselines. The improvement may translate to more rapid onboarding of the new workplace type that can benefit job seekers who are interested in working remotely

    A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection

    Full text link
    The recent decade witnessed a surge of increase in financial crimes across the public and private sectors, with an average cost of scams of $102m to financial institutions in 2022. Developing a mechanism for battling financial crimes is an impending task that requires in-depth collaboration from multiple institutions, and yet such collaboration imposed significant technical challenges due to the privacy and security requirements of distributed financial data. For example, consider the modern payment network systems, which can generate millions of transactions per day across a large number of global institutions. Training a detection model of fraudulent transactions requires not only secured transactions but also the private account activities of those involved in each transaction from corresponding bank systems. The distributed nature of both samples and features prevents most existing learning systems from being directly adopted to handle the data mining task. In this paper, we collectively address these challenges by proposing a hybrid federated learning system that offers secure and privacy-aware learning and inference for financial crime detection. We conduct extensive empirical studies to evaluate the proposed framework's detection performance and privacy-protection capability, evaluating its robustness against common malicious attacks of collaborative learning. We release our source code at https://github.com/illidanlab/HyFL .Comment: PETs prize challenge versio

    Redefining cardiac biomarkers in predicting mortality and adverse outcomes of inpatients with COVID-19

    Get PDF
    The prognostic power of circulating cardiac biomarkers, their utility and pattern of release in coronavirus disease 2019 (COVID-19) patients have not been clearly defined. In this multi-centered retrospective study, we enrolled 3,219 patients with diagnosed COVID-19 admitted to 9 hospitals from December 31, 2019 to March 4, 2020, to estimate the associations and prognostic power of circulating cardiac injury markers with the poor outcomes of COVID-19. In the mixed-effect Cox model, after adjusting for age, gender and comorbidities, the adjusted hazard ratios of 28-day mortality for high-sensitivity cardiac troponin I (hs-cTnI) was 7.12 (95%CI, 4.60-11.03; P<0.001), NT-proB-type natriuretic peptide (NT-proBNP) was 5.11 (95%CI, 3.50-7.47; P<0.001), CK-MB was 4.86 (95%CI, 3.33-7.09; P<0.001), myoglobin was 4.50 (95%CI, 3.18-6.36; P < 0.001), and CK was 3.56 (95%CI, 2.53-5.02; P < 0.001). The cutoffs of those cardiac biomarkers for effective prognosis of 28-day mortality of COVID-19 were found to be much lower than for regular heart disease at about 49% of the currently recommended thresholds. Patients with elevated cardiac injury markers above the newly established cutoffs were associated with significantly increased risk of COVID-19 death. In conclusion, cardiac biomarker elevations are significantly associated with 28-day death in patients with COVID-19. The prognostic cutoffs for of these values might be much lower than the current reference standards. These findings can assist better management of COVID-19 patients to improve outcomes. Importantly, the newly established cutoff levels of COVID-19 associated cardiac biomarkers may serve as useful criteria for the future prospective studies and clinical trials
    • …
    corecore